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Abstract — A write-once memory (worn) is a storage medium 
formed by a number of "write-once" bit positions (wits), where 
each wit initially is in a '0' state and can be changed to a '1' state 
irreversibly. Examples of write-once memories include SLC flash 
memories and optical disks. This paper presents a low complexity 
coding scheme for rewriting such write-once memories, which 
is applicable to general problem configurations. The proposed 
scheme is called the position modulation code, as it uses the 
positions of the zero symbols to encode some information. The 
proposed technique can achieve code rates higher than state-of- 
the-art practical solutions for some configurations. For instance, 
there is a position modulation code that can write 56 bits 10 
times on 278 wits, achieving rate 2.01. In addition, the position 
modulation code is shown to achieve a rate at least half of the 
optimal rate. 

Index Terms — Write-once memories, flash memories, position 
modulation. 



I. Introduction 

In their pioneering work ifTSl . Rivest and Shamir considered 
the problem of rewriting a "write-once memory" (wom). A 
write-once memory consists of a number of "write-once" cells, 
where each wit initially is in a '0' state and can be changed 
to a '1' state irreversibly. Several types of storage media 
follow such a write-once memory model. Examples include 
SLC flash memories and optical disks. Rivest and Shamir lITSl 
demonstrated that it is possible to rewrite such a write-once 
memory multiple times, using coding techniques. For example, 
using 3 wits, we can write a 2-bit variable twice via a wom- 
code. The rate of this wom-code (i.e., capacity per wit) is 
2x2/3 = 1.33. 

In lITSl . Rivest and Shamir presented a wom-code that has 
the best asymptotic rate (as the cardinality of the variable ap- 
proaches infinity), for any given number of writes. This result 
is done via a counting argument and the scheme is not con- 
structive. Rivest and Shamir and others have proposed various 
other wom-codes, which have low complexity. However, the 
coding rates of existing wom-codes with low complexity are 
still far from the theoretically achievable coding rates. Thus, 
the problem of finding good practical wom-codes remains an 
open challenge. 

In this paper, we present a low complexity coding scheme, 
which we call the position modulation code, for rewriting a 
write-once memory. The scheme is built upon a simple yet 
fundamental observation: If we flip k cells out of n binary 
cells that are initially zero, there are n-choose-fc ways of 
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Fig. 1. Two wits are used to implement a symbol that can take four values 
{0, a, b, e}. The all-one value is used to represent the "erased" state. 



doing so, which can represent n-choose-fc distinct messages. 
We call this encoding method position modulation. Clearly, 
position modulation can be directly applied to the first write in 
write-once memories, because we start from the all-zero state. 
Several wom-codes can be viewed as performing position 
modulation in the first write. However, it is not straightforward 
to apply position modulation to subsequent writes, because 
each write flips some wits, and the decoder only sees the 
overall set of written wits but not when they are written. 

The scheme proposed in this paper performs position mod- 
ulation to all writes except the last one. Instead of directly 
applying position modulation to the wits, we apply position 
modulation to symbols that are formed by multiple wits each. 
For example, as illustrated in Figure [T] using every two wits 
in a group, we obtain a symbol that can take four values. 
The all-one value is used to represent the "erased" state. At 
the beginning of each write, we erase all nonzero symbols 
by setting them to the all-one value. After this operation, the 
remaining symbols are all zero and thus we can apply position 
modulation. 

The rest of the paper is organized as follows. In Section HIl 
we present a brief overview of existing research on wom 
coding and related topics. In Section Hill we highlight a simple 
observation that encoding and decoding for position modula- 
tion can be implemented with polynomial time complexity. 
The proposed position modulation code is then described in 
Section |IV] In Section [V] we compare the performance of 
the position modulation code with existing wom-codes. In 
Section fVll we present the conclusions. 

II. Overview of Related Studies 

We briefly overview the existing studies on wom codes. 
They will be referred to with more details later in the paper 
when we compare the performance of different codes. Rivest 
and Shamir defined wom codes in their pioneering paper |18|, 
in which they presented a number of individual wom codes 
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(the most well-known of which is the code for writing two 
bits twice in three wits) and some families of wom codes 
(including the linear wom codes and the tabular codes). The 
wom code model was generaUzed in H by allowing the 
cell state transitions to be any acyclic directed graph; in 
addition, two families of wom codes for data with alphabet 
size three and four were shown. In lfT6l several individual wom 
codes were constructed using projective geometries. In f3| a 
wom code construction based on eiTor-correcting codes was 
presented. As an example, based on the C[23, 12, 7] Golay 
code, a wom code for writing 11 bits three times in 23 wits 
was shown. More works on wom include ||6|, fg], [|2T|. Il22l . 
which studied the capacity and error correction of write-once 
memories. 

WOM is related to the study on defective memory ||8l, ifTOl . 
lfT4l . write unidirectional memory ifTTl . lfT9l . Il20l . and write 
efficient memory (T\, fT\. It is also related to the study on 
coding for flash memories, where many of the proposed coding 
schemes are based on the monotonic transitions of flash cell 
states fm . ifTSll . In particular, the works on rewriting codes 
for flash memories extend wom codes |!2], IS), lfT2l . ifTSl . 

The motivation for this study is to look for a general method 
for constructing wom codes that can achieve low encod- 
ing/decoding complexity and high rates, which can potentially 
be used in practice (e.g., for flash memories). As a result, our 
focus is on cases where the number of rewrites is reasonably 
small and the data size is moderate, instead of asymptotic 
settings. 

III. Position Modulation Has Polynomial 
Complexity 

We use the following notations for wom codes in ifTSl . 
A {vY/n womcode is a code that can write a variable 
of cardinality v t times, using n wits. More generally, a 
{vi,. . . ,Vt)/n womcode is a code that can write a variable 
of cardinality vi the first time, a variable of cardinality V2 the 
second time, and so on, using n wits. The i-th write is also 
called the i-th generation. The rate of the code is: 



l0g2(t^l 



■Vt 



(1) 



In position modulation, we use a length-?! binary vector 



with k ones to represent a variable of cardinality 



Let 



U denote the set of all length-n binary vectors with k ones. 
The most natural approach is to associate each vector u E U 
with its index in the lexical sorted list of U. Let 



£: {0,1, 



1 



(2) 



be a function that computes the lexical order of a vector 
u eU. We now show a simple observation, that the encoding 
and decoding functions £(•) and can be implemented 

efficiently. 

First, consider an example where n — 1 and fc = 3. 
Figure |2] illustrates the process of computing the lexical order 
of the sequence 0101100. The idea is to count the number of 
sequences that have a lower index. We start with the leftmost 



0100*** 
01010** 



Fig. 2. Illustration of the process of computing the lexical order of the 
sequence 0101 100. 



1 and move to the right. First, if we flip the leftmost 1 to 
0, then clearly vectors in U of the form QQ * * * are all 



lexically before 0101100; there are ^ ^ J of such vectors. If 

we keep the second bit as 0, the next group of vectors with a 

lower index than 0101100 is of the form 0100 * There are 
3 \ 

2 j of such vectors. If we keep the first two I's, the next 
group of vectors with a lower index than 0101100 is of the 



form 01010 * *. There are ( ^ | of such vectors. Therefore, 
the index of 0101100 is 

5(0101100) 



1 



= 15. 



More generally, consider a vector u E U that has k one's at 
positions n > Ji > . . . > > (here the bit positions are 
labeled as to n — 1 from right to left). We have: 



k 



«2 
fc - 1 



Ik 
1 



(3) 



Conversely, given i{u), to find u, we determine the bits 
from left to right. First, ii is determined as the largest integer 

such that I j < i{u). Next, 12 is determined as the largest 



integer such that ( j,*^ ) < (.{u) — 



This process 



k~l ) - ^ ' \ k 
can be continued until all ones have been determined. All the 
above computation has time complexity polynomial in n. 

IV. Position Modulation Code 

Theorem 1: Given integers vi,. . . ,vt and m > 2, let hi 
1 < i < i be integers satisfying 

(a) hi > h2 > ■ ■ ■ > ht > Q. 

^hi-h2 ( hi 
fc 



(b) e:lo 



(2™ - 1)'= > vi. 



fc 



(2™ - 2f > v„ i 



,t-l. 



(d) (2™ - 1)'" -l>vt. 
Then, there exists a {vi, . . . ,vt) / [m x hi) wom-code. 
Proof: We organize the n — mhi wits as hi groups, where 
each group consists of m wits and represents a symbol that can 
take 2™ values. The state of the storage cells is then described 
by a vector x = [xi, . . . , x/ij, where each component can take 
values from {0, 1, . . . , 2™ — 1} (each value corresponds to its 
binary representation). Two of the states are special, the zero 
state implemented by the all-zero codeword and the erased 
state implemented by the all-one codeword. 
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At the beginning of each write, we erase all nonzero 
symbols by setting them to the all-one value. The remaining 
symbols are thus all zeros. Using these remaining symbols, 
we encode the message by the number of zero symbols, the 
positions of the zero symbols, and the values of the nonzero 
symbols. 

We now describe the encoding process in detail. For the 
first write, we select up to hi — /12 symbols and write a 
variable with value from {1, . . . , 2™ — 1} to each of them. If we 
write into k symbols, then the positions of the k symbols can 

represent a variable of cardinality ^ ^ ' ^i^^ the values of 

these k symbols can represent a variable of cardinality (2™ — 
l)*^. Since we can write 0, . . . , /ii — symbols, in total we 
can represent a variable of cardinality 



h\—h2 

E 

A;=0 



hi 
k 



1) 



(4) 



which is at least vi according to condition (b). Hence the first 
write can be done. 

For the i-th write with 1 < i < t, first we erase all 
nonzero symbols by changing them into the all-one state. 
If there are more than hi remaining zero symbols, then we 
arbitrarily erase some additional zero symbols so that there 
are exactly hi remaining zeros. Then among the hi remaining 
zero symbols, we pick 1 up to hi — hi+i entries and write a 
value from {1, ... ,2™ — 2} to each of them. Since we can 
write 1, . . . , /li — hi^i symbols, in total we can represent a 
variable of cardinality 



E 

fc=l 



hi 

k 



(2" - 2f 



(5) 



which is at least Vi according to condition (c). Hence the i-th 
write can be done. 

The last write is done differently in that position modulation 
is not used. First we erase all nonzero symbols and possibly 
some additional zero symbols so that there are exactly ht zero 
symbols. We simply use these ht positions to represent the 
variable by setting each position from {0, 1, ... , 2™ — 2}, 
except that the all zero codeword is not used. Since (2™ — 
1)^' — 1 > t}j^ the last write can be done. 

Decoding is done accordingly. First, the generation number 
(i.e., which write) is decoded from the number of zero symbols 
fco. If fco > h2, then it is the first write. If /12 > fco > h^^, then 
it is the second write; and so on. If ht > fcg, then it is the 
last write. Next, the information message is decoded from the 
number of zeros, the positions of the zeros, and the values of 
the nonzero entries (with erased symbols discarded in all but 
the first generation). ■. 

Since the encoding and decoding for position modulation 
can be done with polynomial complexity (in logui, 
log Vt), the resulting position modulation code has polynomial 
encoding and decoding complexity. 

A. Determining the Code Parameters 

To find the coding parameters, we determine the t numbers 
/ii , . . . , /it in reverse order. First, we choose ht as the smallest 



number that satisfies condition (d). More specifically, we 
choose ht to be: 

" log^fa + l) ! 
log2(2™ - 1) • 

Next, we choose ht-i as the smallest number greater than ht 
that satisfies condition (c); and so on. More specifically, for 
1 < i < i we choose hi to be; 



hi — h 



i+l 



5,. - 



E 

fe=i 



(2™ -2f> V, 



(7) 
(8) 



We choose hi to be the smallest number that satisfies condition 
(b), i.e.. 



hi ^ h2 + 5i, 



61 = min < S 



E 

fc=0 



h2 + S 




Lemma 1: Consider given parameters to, t, and vi = . . . = 
Vt — V. Let 5i — hi — /li+i. The above process (namely 
(fTOb ) will output an increasing sequence ht, ht^i, . . . , hi with 
nondecreasing increments, i.e., 

St-i>St-2>.-.>Si. (11) 
Proof: From (|8]l, we have 



k=l ^ ^ 



Since hi > /i^+i, this implies that 

Ef "^^'0(2" 



and 



fc=i 

Si 

E 

fe=0 



> V. 



hi + 5i 
k 



(2™ - 1)'= > V. 



(12) 



(13) 



(14) 



From ^ and ( fTOb we know that 5i-i < 5i. ■ 

Example 1: Consider a code that can write 56 bits of data 
10 times using 278 wits, where a symbol is formed by m = 2 
wits. The rate of this code is: 

^.2.01... 

139 X 2 

In this case, the code paramters are: 



^10 


= 36, 


hg 


-51, 


hs 


= 64, 


hr 


= 76, 


he 


= 88, 


/15 


= 99, 


/14 


= 110, 


hs 


= 120, 


h2 


= 130, 


hi 


= 139. 
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Note from (l6]l-(fT0]i that if we want to obtain a code that 
writes 56 bits of data for t' > 10 times, then the first 9 numbers 
in the above list will remain the same. 



B. Performance Characterization 

We next present a performance characterization of the 
position modulation code, by comparing it with a performance 
bound on womcodes in |18|. The original bound in L18| is for 
the {vY/n case; however, the argument can easily be extended 
to the {vi, . . . ,vt) /n case, as mentioned in fT6\. The following 
lemma presents the bound for the {vi, . . . , Vt) jn case. 

Lemma 2 (Bound on womcodes lUSl ): 
Let . . . , Vt)) denote the least n for which a 

. . . , tij)/n-womcode exists. Then 



Proof: With m = 2, we have 



w((ui,...,i;f)) > Zt{vx,. . . ,vt). 



(15) 



Here Zt{. . .) is a i- variable function defined recursively as 
follows: 



Zt{vi,. Zt^i{v2, 



,Vt) + S{vi,Zt-l{v2, 

t > 1, 



Zo = 0, 



6{v, m) 



E 



m + 5 
i 



> V 



>«t)): 

(16) 
(17) 

(18) 



Proof: The proof is by induction on t. The case t = is 
trivial. Now consider any (fi, . . . ,vt)/n womcode for t > 0. 
For this code, after the first write, there must be at least 
Zt^i{v2, ■ ■ ■ ,Vt) zeros. (If there are less than Zt_i{v2, . . . ,Vt) 
zeros after the first write, it is not possible to accommo- 
date the remaining t — 1 writes since w{{v2, ■ ■ ■ ,Vt)) > 

Zt-l{v2,...,Vt).) 

Therefore, the first write can only use codewords with at 
most K ^ n — Zt-i{v2, . . . ,Vt) ones. Since the first generation 
needs to represent a variable of size vi, we must have: 



E 

i=0 



(19) 



From the definition of 5{v,m), we see that 

n>5(vi,Zt-i{v2,...,Vt)). (20) 

Thus 

n > Zt-i{v2, ■ . ■ ,vt) + S{vi, Zt-i{v2, . ■ . ,vt)). (21) 

Since (1211 1 holds for any {vi, . . . ,Vt) /n womcode, 
w{{vi,...,vt))> Zt{vi,...,vt). m 

Theorem 2: Given vi, . . . ,vt with Vi > 2, consider 
hi, . . . ,ht determined based on (l6]l-(fT0]i for m ~ 2. Then 



hi < Zt{vi,. ..,«*) < w{{vi, . . .,«()). 



(22) 



Therefore, the position modulation code achieves a rate that 
is at least half the optimal rate. 



l0g2(wt + 1) 



logaS 
ht-i = ht + (5f_i, 



hi = h2 + Si, 



(23) 
(24) 

(25) 



where 

6t-i = min <|(5 

5t-2 = min |(5 
Si = min < S 



E V k 

k=l 
S 



'''^^^2'^>vt-i\, (26) 



E 

k=l 



E 

fc=0 



'*l+M2^>.*-2 , (27) 



(28) 



By expanding the expression of Zt{vi, . . . , Vt), we have 



Zi{vt)=S{vt,0) 
Z2{vt-i,vt) = Zi{vt) + S't_i 



(29) 
(30) 

'■■ (31) 

Zt{vi, ...,vt)^ Zt-i{v2, ...,vt)+ S'l, (32) 



where 



S't-i = S {vt-i, Zi{vt)) 

St_2 = S {vt-2,Z2ivt-l,Vt)) 



S'l = S {vi,Zt-l{v2, ■ ■ ■,Vt)) . 



(33) 
(34) 

(35) 



To simplify notations, we use the shorthand notation Zk to 
refer to Zk{vt-k+i, ■ ■ ■ , Vt). We now show via induction over 
k that 

ht-k+i < Zk. (36) 
The case k = 1 follows from the fact that for vt > 1, 

"logaK + 1)" 



5{vt,0) = \\0g2Vt] > 



log2 3 



= ht. (37) 



Suppose ht-k+i < Zk. We now show that ht-k < -^fc+i- 
Note that S[_^ > 1 and 



E 

k=l 

E 

A:=0 



Zk+1 



2'^>E 



k=0 



Zk+i 



> vt-k (38) 



k=0 
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Since > ht-k+i, Zu + 5[_k - ht-k+i > Thus 

^ ( ^^^+M2'=>«*-. (40) 

k = \ ^ ' 

fc=0 ^ ' 

Since Z^+i = Z^. + (5J_j. - /it_fc+i + Ih-k+i, from (|26ll-(|28]l 
we see that 

(^t-fc < Zk+ S't_k - ht-k+i, (42) 

or equivalently, 

ht-k < Zk+i. (43) 
Thus, using induction we can show that hi < Zt{vi, . . . , Vt). 

m 

V. Performance Comparison 

In this section we compare the performance of the position 
modulation codes with existing wom-codes. For the position 
modulation code, there are some cases where setting m = 
3 gives slightly better performance than setting m ~ 2. For 
example, to write 56 bits twice, the position modulation code 
needs 98 wits with m = 2, and 96 wits with to = 3. However, 
setting TO = 2 generally gives good performance among all 
choices of m. In the following, we use to 2 for the position 
modulation code. 

Table U gives the performance comparison of position mod- 
ulation code (we use v — 2^^) and known low complexity 
{vY /n codes with the best rates, for i = 2, . . . , 10. The known 
low complexity codes with the best rates are given in 

the top row of the table. The position modulation codes and 
their rates are given in the bottom row of the table. 

We now explain the codes in the top row from left to right. 
The (26)^/7 code is presented in ifTSl and this code is found 
via computer search, according to fTSl. The (63)'^/12 code is 
from James B. Saxe's construction of a (65, 81, 63)/12 code, 
which was mentioned in fTSl. There are two constructions 
of {1)^/1 codes. One is a cyclic womcode constructed by 
David Leavitt (mentioned in ifTSl ) and the other is a womcode 
constructed from projective geometry by Frans Merkx |;16J . 
The (11)^/11 womcode, designed by M. Beveraggi, is based 
on Steiner pentagonal systems, according to Q. There are 
two constructions of the (16)^/15 code. One is the linear 
coset code given in O (Proposition 5 in O). The other is 
the linear code given in ifTSl . The (15)^/15 womcode is also 
constructed from projective geometry 1161 . The next three 
codes are all obtained by combining the (15)^^/15 womcode 
with some other small womcodes, since there are no known 
specific designs for t = 8,9,10 with rates higher than 1.62. 
This is done by using two observations given in ifTSl . First, 
by concatenating a code and a (w)*^/7i2 code side 

by side, we can have a {vY^'^^^ / [rii +712) code (Information 
is represented by the modulo-w sum of the values of the two 
subcodes). Second, by concatenating a {vi)*- /ni code and a 



{v-if 1^2 code side by side, we obtain a {v\ ■ V2Y /{ni + 712) 
wom-code (Information is represented as the ordered pair of 
the two values). We obtain the (15)^/19 code by concatenating 
the (15)^/15 code with a (15)^/4 code (simply using the 4 
bits to represent a 15-ary variable). We obtain the (15)^/21 
code by concatenating the (15)^/15 code with a (15)'^/6 code 
(based on the (16)^/6 code given in lITSl ). We obtain the 
(15)^°/24 code by concatenating the (15)^/15 code with a 
(15)'^/9 code, which is obtained by concatenating a (5)'^/5 
code and a (3)^/4 code. The (5)'^/5 code is a cyclic code 
designed by David Klarner (see [18|) and the (3)'^/4 code is 
given in |4|. 

For each i e {2, . . . , 10}, the best rate is shown in boldface. 
It is seen that the position modulation code offers higher rates 
than state-of-the-art solutions for i = 5, 6, 8, 9, 10. 

For larger values of t, we compare the position modulation 
code with three existing general code designs that produce 
classes of codes. In [18.1 , Rivest and Shamir presented a 
— 1) linear code. The rate of this code is less 
than 2 for t < 50. In fA\, Fiat and Shamir presented a 
(3)*/(i + 1) code that works for arbitrary t. The rate of this 
code is always less than 1.59. In |3|, Cohen et al. presented 
a (2'')^'^ '+^/(2'' — 1) code that works for r > 4; according 
to the paper, the encoding process is NP-hard. Figure [5] shows 
the rates of these wom-coding schemes, where the position 
modulation code is for to = 2 and v = 2^^. It is seen that the 
position modulation code achieves higher rates. 

VI. Conclusion 

We presented a method for constructing worn codes with 
low encoding and decoding complexity and high rates, which 
works for general problem configurations. The proposed 
method is called position modulation code, as it is built upon 
a simple and yet fundamental observation, that positions of 
zeros in a binary array can be used to encode information. 
The position modulation code applies position modulation 
to symbols formed by multiple cells and performs position 
modulation in all writes except the last one, by using a "soft" 
erasing operation to reset the state between writes. 

It is proven that the position modulation code achieves a 
rate that is at least half of the optimal rate. Furthermore, the 
proposed position modulation codes is seen to have superior 
rates compared to existing wom codes. The low complexity 
and high rate of the position modulation code make it a 
promising candidate for practical applications. 

In addition, as a family of codes that accommodate general 
parameter configurations, the position modulation code can 
be readily used in related rewriting schemes. For example, 
the trajectory code |T2l for a generalized rewriting model 
can use the position modulation code as a subcode for better 
performance. 
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